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The experimental search for new thermoelectric materials remains largely confined to a limited set of success¬ 
ful chemical and structural families, such as chalcogenides, skutterudites, and Zintl phases. 1 3 In principle, 
computational tools such as density functional theory (DFT) offer the possibility of rationally guiding exper¬ 
imental synthesis efforts toward very different chemistries. However, in practice, predicting thermoelectric 
properties from first principles remains a challenging endeavor,® and experimental researchers generally do 
not directly use computation to drive their own synthesis efforts. To bridge this practical gap between exper¬ 
imental needs and computational tools, we report an open machine learning-based recommendation engine 
( http://thermoelectrics.citrination.com) for materials researchers that suggests promising new thermoelectric 
compositions, and evaluates the feasibility of user-designed compounds. We show that this engine can identify 
interesting chemistries very different from known thermoelectrics. Specifically, we describe the experimen¬ 
tal characterization of one example set of compounds derived from our engine, RE 12 C 05 B 1 {RE = Gd, Er), 
which exhibits surprising thermoelectric performance given its unprecedentedly high loading with metallic d 
and / block elements, and warrants further investigation as a new thermoelectric material platform. We show 
that our engine predicts this family of materials to have low thermal and high electrical conductivities, but 
modest Seebeck coefficient, all of which are confirmed experimentally. We note that the engine also predicts 
materials that may simultaneously optimize all three properties entering into zT; we selected RE\ 2 Co 5 Bi for 
this study due to its interesting chemical composition and known facile synthesis. 
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I. INTRODUCTION 

For any materials problem, breaking out of “local op¬ 
tima” in composition space to discover entirely new 
chemistries remains a notoriously difficult challenge. 5 
Many of the most notable materials classes under inves¬ 
tigation today-from Na„CoC >2 derived thermoelectric^ 
to iron arsenide superconductors 7 -were discovered for¬ 
tuitously. As a result, experimental efforts often gravitate 
toward incrementally improving known chemistries (via 
doping, nanostructuring, etc.), as these efforts are more 
likely to bear fruit than high-risk searches through chem¬ 
ical whitespace for entirely new materials. 

The consequence of research communities’ focus on 
further exploitation of known chemistries rather than 
exploration of unknown chemistries is that much of 
composition space simply remains uncharacterized. In 
Fig.JTJi, we illustrate the remarkable chemical homogene¬ 
ity of most thermoelectric materials investigated to date. 
We plot each material from the thermoelectric database 
of Gaultois et a?.® on the periodic table based on the 
composition-weighted average of the positions of ele¬ 
ments in the material. The tight cluster of previously 
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investigated chemistries is, as expected, dominated by 
chalcogenides and p-block elements such as Sn and Sb. 
In contrast, we also show the positions of Gdi 2 Co 5 Bi and 
Eri 2 Co 5 Bi, materials derived from our recommendation 
engine, which we characterize as a new class of ther¬ 
moelectrics in this work. These materials are almost 
pure intermetallics, in sharp contrast to thermoelectric 
compounds investigated to date (Fig.[lj5) . The objective 
of our recommendation engine is to directly enable ex¬ 
perimental researchers to rapidly identify new materials, 
such as REi 2 Co 5 Bi, that are very distinct from known 
compound classes, and worthy of further study. 


A materials recommendation engine 

Our recommendation engine is a machine learning- 
based approacfl2l!2l f or efficiently driving synthetic ef¬ 
forts toward promising new chemistries. We have trained 
a machine learning model to make a confidence level 
prediction of whether the (1) Seebeck coefficient, (2) 
electrical resistivity, (3) thermal conductivity, and (4) 
band gap of input materials are within acceptable ranges 
for thermoelectric applications. We define these ranges 
as follows: (1) IS) > 100/iVK -1 ; (2) p < 10 -2 H cm; 
(3) k < 10 W m^ 1 K _1 ; and (4) E g > 0 eV, all at room 
temperature. For each range, the engine gives a confi¬ 
dence score between 0 and 100% that a given material’s 
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FIG. 1. (a) Most known thermoelectric materials lie in a tight cluster in composition space (black and blue dots; blue dots have 
chemical formulae explicitly labelled). The recommendation engine presented here allows the identification of new thermoelectric 
materials families that are well outside the existing composition space of common systems; in particular, we report the charac¬ 
terization of RE 12 C 05 B 1 {RE = Gd, Er; orange squares) in this work, which are chemically and structurally distinct from known 
thermoelectrics, (b) The strongly intermetallic AEj^CosBi compounds we report here lie far outside the norm for metal loading 
among collected thermoelectric compositions in the Gaultois et ai.® database. 


measured value for that property at room temperature 
will fall within the targeted range. We would classify 
any material for which the answer to all these questions 
is likely “yes” as a potentially promising thermoelectric 
that may warrant further study. The purpose of our rec¬ 
ommendation engine is thus neither to make quantita¬ 
tive predictions of these thermoelectric properties, nor 
to definitively identify record-setting compounds-these 
remain open challenges for future work. Rather, the en¬ 
gine is intended to greatly augment the chemical intu¬ 
ition of experimental researchers working on materials 
discovery. In particular, we have found that our model’s 
ability to screen vast numbers of possible compositions 
and short-list interesting candidates can inspire materi¬ 
als syntheses that would not have been obvious a priori. 


II. METHODS 

Modelling and informatics 

Here we describe the approach used to construct the 
recommendation en gine. Our engine is an example of 
materials informatics ]—^ or the application of empirical 
machine learning methods to the prediction of materials 
behavior. Any machine learning approach for materials 
relies on three key ingredients: training data, descrip¬ 
tors, and choice of algorithm. Training data are the ex¬ 
ample sets from which the machine learning approach 
should extract meaningful chemical trends. Descriptors 
are the low-level characteristics of materials (e.g., crys¬ 
tal structure, chemical formula, etc.) that might corre¬ 
late with materials properties of interest. Specifically, de- 
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scriptors are either numerical (e.g., average atomic num¬ 
ber Z ) or categorial (e.g., crystal structure=perovskite) 
variables that enable us to “vectorize” materials in such 
a way that they become amenable to machine learn¬ 
ing techniques. Finally, learning algorithms interrogate 
descriptor-vectorized training data for relevant patterns. 

In this work, the training set comprises a large body of 
both experimental thermoelectric characterization dateP 
and first principles-derived electronic structure data.® 13 
These data are publicly available via the Citrination plat¬ 
form (http://www.citrination.com) and/or the Materi¬ 
als Project API (http://www.materialsproject.org/open). 
These data consist of the Seebeck coefficients, thermal 
conductivities, electrical conductivities, and band gaps 
measured for thousands of materials as a function of 
temperature and a variety of other metadata conditions. 
Our model uses these input data to learn interesting 
chemical trends that could be exploited to design new 
materials. As large, high-quality training data sets are 
scarce in materials science relative to the biological sci¬ 
ences, where bioinformatics has become a standard tool, 
we urge the materials community to consider contribut¬ 
ing to data infrastructures (Citrination, Materials Project, 
NIST’s DSpace repository, EU’s NoMaD, and others) that 
together will significantly expand open access to data for 
materials researchers. 

Descriptors are the second key ingredient in materi¬ 
als informatics. The scientific literature around design¬ 
ing descriptors for materials has grown substantially in 
just the past several yearsfSED Indeed, recent work has 
shown that the predictive power of machine learning 
models for materials is strongly dependent upon the se¬ 
lected descriptor setJ-23 Our engine relies upon a tuned 
blend of descriptors designed in-house and drawn from 
a variety of sources.® By way of example, as materials 
scientists, we recognize that the periodic table contains 
a tremendous amount of information about how the el¬ 
ements behave and interact. We thus pre-bias our ma¬ 
chine learning models with such knowledge (e.g., the 
d, block of the periodic table is metallic; Li and Na are 
chemically very similar but not identical; and the lan¬ 
thanides behave similarly in ionic compounds). This step 
allows us to create predictive models with data sets that 
have thousands (rather than tens or hundreds of thou¬ 
sands) of examples. 

Finally, our recommendation engine is built using the 
so-called random forest algorithm. 17 This algorithm con¬ 
structs a large number of decision trees, all trained on 
slightly different subsets of the training data. Random 
forest is an ensembling technique, which takes advan¬ 
tage of the fact that a collection of “weak” learners such 
as decision trees can, in concert, model extraordinarily 
complex nonlinear behavior. An example rule that a sin¬ 
gle decision tree might learn is that if a material con¬ 
tains two elements with very different electronegativities 
(e.g., Na and Cl), that material is likely to have a large 
band gap. Of course, the thermoelectric phenomena we 
seek to model here are substantially more subtle, and 


thus a large random forest of decision trees is useful in 
untangling the underlying physics. We refer the reader 
elsewhere®® for more detailed discussions and tutori¬ 
als on how to apply random forests to materials data. 


Model validation 

We visualize the accuracy of our recommendation en¬ 
gine’s predictions in Fig. [2] which represents the results 
of leave-one-out cross-validation (LOOCV) on our train¬ 
ing data (in the case of the band gap data, we performed 
LOOCV on a subset of the extremely large training set). 
In the LOOCV procedure, if we have n total measure¬ 
ments of a particular property such as thermal conduc¬ 
tivity, we train our machine learning model on n — 1 of 
these values and predict the nth (left out) value. We per¬ 
form one training step and prediction for each property 
value, and present the error distribution for all n values 
in Fig. [2] The error distribution then provides us with 
a sense of how we may expect the model to perform on 
new materials of which we have no prior knowledge. 

Fig. [2] indicates that our engine generally makes very 
reliable assessments of thermoelectric materials proper¬ 
ties. The modes of the error distributions are in each case 
close to 0. For each property, the engine’s errors skew to¬ 
ward false negatives (resistivity, band gap, thermal con¬ 
ductivity) or false positives (Seebeck), which reflects the 
fact that the underlying training data do not contain 
equal fractions of positive and negative examples. See¬ 
beck coefficients prove most difficult to assess (i.e., the 
error distribution for that property has the largest stan¬ 
dard deviation), likely because there are strikingly dif¬ 
ferent mechanisms that underpin the values, for exam¬ 
ple, strongly correlated oxides as opposed to degenerate 
semiconductors. Owing to the difficulty in assessing the 
Seebeck coefficient, initial predictive models using only 
the electrical resistivity, thermal conductivity, and See¬ 
beck coefficient produced too many candidates that were 
good metals with poor Seebeck coefficients. To remedy 
this shortcoming and provide more robust recommen¬ 
dations, the band gap was added as a secondary met¬ 
ric, where we determine the probability whether a given 
composition will have a non-zero bandgap. 


Experimental details 

RE 1 2 Co 5 Bi {RE = Gd, Er) samples were made by arc¬ 
melting freshly filed Er or Gd pieces (99.9%, Hefa), Co 
powder (99.8%, Cerac), and Bi powder (99.999%, Alfa 
Aesar). Stoichiometric mixtures (0.5 g total mass) with 
5-7% excess of bismuth were pressed into pellets and 
melted twice in arc-melting furnace under argon atmo¬ 
sphere (Edmund Buhler Compact Arc Melter MAM-1). 
The total mass loss after melting was < 1%. The sam¬ 
ples were sealed in silica tubes and annealed at 1070 K 
for one week, then quenched in cold water. To pro- 
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FIG. 2. Leave-one-out cross validation error histograms for the four key properties estimated by our recommendation engine: (a) 
Seebeck coefficient; (b) electrical resistivity; (c) thermal conductivity; and (d) band gap. For each material in our training set 
and each property, the recommendation engine gives a confidence score between 0 and 1 that the property value falls within the 
ideal windows we have defined for thermoelectric applications. Errors approaching +1 represent false negatives (our engine was 
extremely confident the material would be poor for that property, but the property is actually good); and an error of — 1 is a false 
positive (our engine was extremely confident the material would be good for that property, but the property is actually poor). 
The peak around 0 for each property shows that the engine generally gives confidence values very close to unity for materials 
possessing properties in the desired ranges, or close to zero for materials whose property values fall outside the target range. 


duce enough material for physical property measure¬ 
ment, ~70 samples of each compound were prepared, 
and pure samples were combined by melting into a sin¬ 
gle ingot of ~5 g, which was sanded to yield the appro¬ 
priate geometry (either a rectangular bar, or a cylinder). 
Density was measured using Archimedes’ method; the fi¬ 
nal pellets had densities 100% of the single crystal values 
(hGd 12 co 5 Bi = 8.6g/cm 3 , PEr 12 co 5 Bi = 9.9g/cm 3 ). 

Powder X-ray diffraction patterns were collected us¬ 
ing an INEL CPS 120 diffractometer with CuKqi ra¬ 
diation at room temperature, and Rietveld refinement 
was used to confirm the structure and phase purity 
(see Supporting Information). Backscatter electron 
microscopy and elemental analysis via energy disper¬ 
sive X-ray spectroscopy (EDX) were performed with a 
JEOL JSM-6010LA InTouchScope scanning electron mi¬ 
croscope. Backscatter microscographs reveal the sam¬ 
ples are largely compositionally homogeneous (see Sup¬ 
porting Information). Quantitative elemental analysis 
on several polished pieces found an atomic composi¬ 
tion of Gd69(2)Co26(2)Bi5(2) which is in a good agree¬ 
ment with expected .RE^CosBi composition. Er^Co.gBi 
samples were not appropriate for quantitative analysis 
because of overlaping Co Kct (6.924 keV) and Er La 


(6.947 keV) lines. 

High-temperature thermoelectric properties (electrical 
resistivity and Seebeck coefficient) were measured with 
an ULVAC Technologies ZEM-3. Sample bars had ap¬ 
proximate dimensions of 9 mm x 4 mm x 4 mm. Measure¬ 
ments were performed with a helium under-pressure, 
and data was collected from 300 K to 800 K through three 
heating and cooling cycles over 18 hours to ensure sam¬ 
ple stability and reproducibility. 


III. DISCUSSION 

In this work, we are interested not only in develop¬ 
ing a model that gives accurate predictions of materials 
properties, but also in making it immediately accessible 
and useful for experimental researchers. To that end, 
we have published our recommendation engine as a web 
app at http://thermoelectrics.citrination.com, where re¬ 
searchers may explore a pre-computed list of around 
25,000 known compounds (representing a sizable sub¬ 
set of the Inorganic Crystal Structure Database, or ICSD), 
and also use our model to evaluate in real-time their own 
materials candidates. In this way, we hope that the app 
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serves as a rapid triage tool for ideas for potential new 
thermoelectric materials. Our pre-computed list may be 
arranged according to the probabilities associated with 
any one of the four properties we are modelling, and is 
sorted by default according a composite score that takes 
all four properties into account. Furthermore, the user 
may specify cutoff thresholds for any of the properties, 
and thereby greatly reduce the size of the list. 

As we believe our extensive precomputed list contains 
some interesting and heretofore uncharacterized candi¬ 
date thermoelectric materials, we now comment on a 
select set of high-ranking compounds. Several of these 
compounds are given in Table [IJ 

TaVOs and TaPO.-, occur in an analogous crystal struc¬ 
ture to the phosphate tungsten bronzes . l9 dM These ma¬ 
terials can be expected to have good thermoelectric per¬ 
formance given the heavy atoms, the potential for low 
electrical resistivity provided by the repeating ReO ;i -type 
structural network that is highly connected in three di¬ 
mensions, and the intrinsic crystallographic shear pro¬ 
vided by the crystal structure. Although the phosphate 
tungsten bronzes themselves are not highly rated, their 
metallic electrical transport properties are encouraging 
for structural analogues®! Moreover, TaV0 5 has a neg¬ 
ative coefficient of thermal expansion and a structural 
transition at 600° C® This structural transition may lead 
to softening of phonon modes and anharmonic scat¬ 
tering, which may lead to low thermal conductivity. 
The second material of interest we present is Tl 9 SbTe 6 . 
Though this compound was not included in the thermo¬ 
electric database, it scores highly within the recommen¬ 
dation engine, and good thermoelectric performance has 
been subsequently demonstrated in recent work® 

The suggestion of TaAlC> 4 , SrCrC> 3 , TaSbC >4 and other 
oxides expected to be insulators can be understood be¬ 
cause the recommendation engine uses as training data 
references where stoichiometric formulas were primar¬ 
ily reported rather than doping details.® 25 Nevertheless, 
with doping through substitution or reduction, these 
compound may exhibit moderate electrical performance. 
Further, these materials all feature extended structures 
that are highly connected in three dimensions, an impor¬ 
tant feature for low electrical resistivity. Moreover, the 
large mass contrast on the cation sublattice in TaAlC >4 
(edge shared TaO (l and A1C>6 octahedra) could lead to 
low thermal conductivity, and previous reports have 
shown that SrCrCH is metallic when synthesized under 
pressure® 

Many of the high-ranking candidate materials are in¬ 
teresting because of their highly connected extended 
structures, even though the recommendation engine 
does not use features of crystal structure to make its 
suggestions. The chief disadvantage to training predic¬ 
tion algorithms using crystal structure is that structure 
then becomes a required input for making predictions, 
and yet structure is by definition not available for un¬ 
characterized materials. However, the absence of crystal 
structure does cause our engine difficulty where changes 


in crystal structure with similar elemental compositions 
cause large changes in physical properties. For example, 
both DyP 04 and LaP0 4 are predicted to have low ther¬ 
mal conductivity. However, LaPC >4 is monazite, a corner 
edge-shared structure, whereas DyPC >4 is xenotime, 27 an 
edge-shared structure leading to inherently higher ther¬ 
mal conductivity®] 


New materials and their properties 

Our final and most important task in this work is to 
demonstrate that our recommendation engine can in¬ 
deed guide researchers toward interesting experimental 
discoveries. Among the set of high-scoring candidate ma¬ 
terials, we selected Er 12 Co 5 Bi and Gd 12 Co 5 Bi to char¬ 
acterize as thermoelectric materials due to their facile 
synthesis through arc melting, and due to the fact they 
are chemically quite distinct from known thermoelectrics 
(Fig. 0. While the /<7v| 2 Co 5 Bi (RE = rare earth) family 
of compounds has only been sparsely studied in the liter¬ 
ature, their crystal structure and initial low-temperature 
electrical and magnetic properties have been reported by 
Mar and coworkers® The crystal structure of f?£’i 2 Co 5 Bi 
is shown in Figure0 

Interestingly, the crystal structure of our candidate 
thermoelectric exhibits notable similarity to the struc¬ 
tures of known thermoelectrics, in spite of the fact 
that crystal structure was not an input feature for 
our recommendation engine. Hoi 2 Co 5 Bi is the epony¬ 
mous structure prototype (orthorhombic, space group 
Immm ) adopted by a series of rare-earth intermetallics 
2 Co 5 Bi (RE = Y, Gd,..., Tm). In this structure, the 
Hoi 2 Bi icosahedra play an analogous role to the LaPi 2 
icosahedra in the filled skutterudite prototype LaFe 4 Pi 2 ; 
rare-earth atoms “rattling” within their 12-fold coordi¬ 
nated cages is the idiosyncratic feature of filled skutteru- 
dites that imparts low thermal conductivity so prized in 
thermoelectric materials. In fact, if the transition metal 
atoms, which occupy different sites in these structures, 
are disregarded, the Hoi 2 Bi framework is an antitype to 
the LaPi 2 framework, with the roles of the rare-earth and 
group 15 elements reversed. We hypothesize its crys¬ 
tallographic similarity to skutterudite could be partly re¬ 
sponsible for the thermoelectric behavior of REi 2 Co 5 Bi 
(RE = Gd, Er). 

We give a full thermoelectric characterization of 
Eri 2 Co 5 Bi and Gdi 2 Co 5 Bi in Fig. [4] Based on these re¬ 
sults, we report the discovery of a new thermoelectric 
class, which remains a completely unoptimized, pure 
bulk material and thus lends itself to further study. No¬ 
tably, the material falls far outside the usual search space 
for thermoelectrics (Fig. 0, and was neither the result of 
simple interpolation between known compounds nor ob¬ 
vious from a strict chemical intuition standpoint. The 
electrical resistivity is commensurate with other high- 
performing materials such as chalcogenides, although 
the Seebeck coefficient is too low for the material to be 
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TABLE I. Several promising new thermoelectric compounds selected from our pre-computed list. The P values refer to the engine’s 
confidence level that a given material will exhibit a room-temperature value for a particular property (e.g., S or p ) within the 
target ranges specified above. The full compound list is available for exploration at http://thermoelectrics.citrination.com 


Material 


P s 


Composite Comments 


TaPOs and TaVOg 0.894 0.793 0.958 0.987 

3.537 

TlgSbTee 

0.845 0.871 0.999 0.876 

3.46 

TaA10 4 

0.893 0.703 1 0.977 

3.477 

SrCrO.i 

0.772 0.767 0.996 0.95 

3.308 

TaSbCH 

0.892 0.919 1 0.997 

3.559 


High polyhedral connectivity and structural superlattices 
Recently reported to be a good thermoelectric (zT « 1 at 600 K) 
High mass contrast, high polyhedral connectivity 
(edge- and corner-sharing TaO c octahedra) 

High polyhedral connectivity (3-D corner-sharing Cr06 octahedra), 
metallic when made under high pressure 

High polyhedral connectivity: layered, edge-sharing MOe octahedra 


(cl) RE 12 C OsBi 



FIG. 3. (a) Crystal structure of RE 12 C 05 B 1 (prototype H 012 C 05 BO, of which Eri^CosBi and GdioCosBi are exemplars, (b) Crystal 
structure of the filled skutterudites, which have the generic chemical formula AM 4 X 12 . These two structure types share an 
icosahedral motif consisting of RE 12 B 1 and AX) 2 units, respectively. 


competitive with the best-known thermoelectrics. Fur¬ 
thermore, the thermal conductivity is relatively high, 
but the filled cage structure lends itself to substitution 
that has successfully reduced thermal conductivity in the 
skutterudite systems.®® In REi 2 Co 5 Bi (RE = Gd, Er), 
the thermal conductivity from 300 K to 800 K ranges 
from 4Wm -1 K -1 to 8Wm^K _1 , comparable to the 
half-Heuslersfh-l22l Note that, as illustrated in Fig. [ 4 ] 
these results are in accord with the engine’s predic¬ 
tions; the models give a high probability of achieving 
the thresholds for electrical conductivity (a) and thermal 


conductivity (c) (see confidence bar insets), while also 
suggesting a low probability of observing a large See- 
beck coefficient (b). The electrical performance figure of 
merit kzT is around 0.03 Wm^ 1 K -1 at 400 K, which is 
actually higher than that of nearly 30% of the thermo¬ 
electrics in the Gaultois et al. thermoelectrics database,® 
of course, the database is a highly self-selected set of ma¬ 
terials, consisting of literature-reported thermoelectrics, 
and would skew toward much higher kzT values than 
would a random subset of all crystalline materials. We 
note, of course, that the zT of several other thermoelec- 
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|S|> lOOfaVK- 1 p< IO- 2 Dcm k< lOWm-'K-' 

FIG. 4. Thermoelectric characterization of i?i?i 2 Co 5 Bi (RE = Gd, Er). (a) Electrical resistivity, (b) Seebeck coefficient, (c) thermal 
conductivity, and (d) thermoelectric figure of merit zT as a function of temperature. We also include the recommendation engine’s 
confidence levels for the first three properties; the lowest-probability property, the Seebeck coefficient, is indeed found to be below 
the 100/rV/K threshold. 


trie materials can be significantly improved through car¬ 
rier concentration tuning and microstructural engineer¬ 
ing. For example, undoped polycrystalline Si has a 60- 
fold increase in performance after optimization, going 
from zT < 0.01 to 0.6 at 300 kP 

Another observation from Fig. [4] illustrates the sci¬ 
entific boon of studying entirely new classes of materi¬ 
als. Unexpectedly, AA'i^Co.-.Bi (RE = Gd, Er) exhibits 
increasing thermal conductivity with temperature. (We 
note the recommendation engine successfully chose a 
material with a low thermal conductivity at room tem¬ 
perature, which would normally decrease with increas¬ 
ing temperature.) The increasing electrical resistivity 
with temperature indicates metallic electrical transport, 
so the electrical contribution to the total thermal con¬ 
ductivity should therefore decrease with increasing tem¬ 
perature. Additionally, the phonon contribution to ther¬ 
mal conductivity should also decrease with increasing 
temperature due to more phonon-phonon (Umklapp) 
scattering.®] Thermal conductivity is calculated from the 
following relation: n = ap G' p , where a is thermal dif- 


fusivity, C p is heat capacity, and p is density. Normally, 
thermal diffusivity has a negative temperature depen¬ 
dence whereas heat capacity and density both have pos¬ 
itive temperature dependence. However, for this com¬ 
pound we observe a positive temperature dependence 
for the thermal diffusivity even after multiple measure¬ 
ments, the origin of which is not presently understood. 
Materials with increasing thermal conductivity with tem¬ 
perature are rare, though not unprecedented]®]®] and 
further studies on this class of compounds to shed light 
on this anomaly could thus lead to new strategies for 
thermoelectric materials optimization. 


IV. CONCLUSIONS 

This initial experimental validation of our recommen¬ 
dation engine is encouraging. The present work repre¬ 
sents the first time that machine learning has been used 
to suggest an experimentally viable new compound from 
true chemical white space, where no prior characteriza- 
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tion had hinted at promising chemistries. The implica¬ 
tion is that our approach-wherein a data-driven compu¬ 
tational tool directly augments experimental capabilities 
and intuition-is a semi-rational way to discover new ma¬ 
terials families that may have desirable properties. We 
suggest that such an paradigm could eventually replace 
trial-and-error and fortuity in the search for new materi¬ 
als across a wide variety of application areas. 
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